Goto

Collaborating Authors

 health score


Enterprise Disk Drive Scrubbing Based on Mondrian Conformal Predictors

arXiv.org Artificial Intelligence

Disk scrubbing is a process aimed at resolving read errors on disks by reading data from the disk. However, scrubbing the entire storage array at once can adversely impact system performance, particularly during periods of high input/output operations. Additionally, the continuous reading of data from disks when scrubbing can result in wear and tear, especially on larger capacity disks, due to the significant time and energy consumption involved. To address these issues, we propose a selective disk scrubbing method that enhances the overall reliability and power efficiency in data centers. Our method employs a Machine Learning model based on Mondrian Conformal prediction to identify specific disks for scrubbing, by proactively predicting the health status of each disk in the storage pool, forecasting n-days in advance, and using an open-source dataset. For disks predicted as non-healthy, we mark them for replacement without further action. For healthy drives, we create a set and quantify their relative health across the entire storage pool based on the predictor's confidence. This enables us to prioritize selective scrubbing for drives with established scrubbing frequency based on the scrub cycle. The method we propose provides an efficient and dependable solution for managing enterprise disk drives. By scrubbing just 22.7% of the total storage disks, we can achieve optimized energy consumption and reduce the carbon footprint of the data center.


Developing Hybrid Machine Learning Models to Assign Health Score to Railcar Fleets for Optimal Decision Making

arXiv.org Artificial Intelligence

Developing Hybrid Machine Learning Models to Assign Health Score to Railcar Fleets for Optimal Decision Making Mahyar Ejlali, Ebrahim Arian, Sajjad Taghiyeh, Kristina Chambers, Amir Hossein Sadeghi, Demet Cakdi, Robert B Handfield An expert hybrid predictive fault method is proposed based on fast-DBSCAN and PCA. Inspection data from 1986-2020 of North American Railcar Owner (NARO) is used. The model is able to predict future faults in the railcar fleet accurately. Abstract A large amount of data is generated during the operation of a railcar fleet, which can easily lead to dimensional disaster and reduce the resiliency of the railcar network. To solve these issues and offer predictive maintenance, this research introduces a hybrid fault diagnosis expert system method that combines density-based spatial clustering of applications with noise (DBSCAN) and principal component analysis (PCA). Firstly, the DBSCAN method is used to cluster categorical data that are similar to one another within the same group. Secondly, PCA algorithm is applied to reduce the dimensionality of the data and eliminate redundancy in order to improve the accuracy of fault diagnosis. Finally, we explain the engineered features and evaluate the selected models by using the Gain Chart and Area Under Curve (AUC) metrics. We use the hybrid expert system model to enhance maintenance planning decisions by assigning a health score to the railcar system of the North American Railcar Owner (NARO). According to the experimental results, our expert model can detect 96.4% of failures within 50% of the sample. This suggests that our method is effective at diagnosing failures in railcars fleet. Keywords: Expert system, Predictive maintenance, Railcar maintenance, Machine learning, Maintenance health score 1. Introduction Maintenance consists of activities that ensure the railcar assets continue to operate safely and reliably. These activities include inspection, repair, testing, and replacement of parts.


Connecting the Business to Data Science with ModelOps

#artificialintelligence

The inner workings of data science--much like that of multi-parameter, opaque machine learning models--have traditionally been an enigma to the average business user. While the latter simply desires accurate predictions to do his or her job better, how exactly cognitive computing aids this objective, and if it actually is doing so, has rarely been clear to these professionals. Will all the media fervor and enterprise spending on statistical applications of Artificial Intelligence, it's no longer acceptable for organizations to continue investing in data science without some assurance of the impact, positive or otherwise, their initiatives are producing. According to Datatron CEO Harish Doddi, "Over the last few years, so many organizations have invested in AI talent. 'What is the ROI of these models?' is the question that's coming from the business."


The Healthy States of America: Creating a Health Taxonomy with Social Media

arXiv.org Artificial Intelligence

Since the uptake of social media, researchers have mined online discussions to track the outbreak and evolution of specific diseases or chronic conditions such as influenza or depression. To broaden the set of diseases under study, we developed a Deep Learning tool for Natural Language Processing that extracts mentions of virtually any medical condition or disease from unstructured social media text. With that tool at hand, we processed Reddit and Twitter posts, analyzed the clusters of the two resulting co-occurrence networks of conditions, and discovered that they correspond to well-defined categories of medical conditions. This resulted in the creation of the first comprehensive taxonomy of medical conditions automatically derived from online discussions. We validated the structure of our taxonomy against the official International Statistical Classification of Diseases and Related Health Problems (ICD-11), finding matches of our clusters with 20 official categories, out of 22. Based on the mentions of our taxonomy's sub-categories on Reddit posts geo-referenced in the U.S., we were then able to compute disease-specific health scores. As opposed to counts of disease mentions or counts with no knowledge of our taxonomy's structure, we found that our disease-specific health scores are causally linked with the officially reported prevalence of 18 conditions.


'Many' ways to create artificial intelligence. Just ask the UK's AI businesses

#artificialintelligence

Nothing brings a smile to the face of Sabine Toulson – co-founder in 1995 of Intelligent Financial Systems – faster than the notion that AI and its associated technologies are "something new". Both Sabine and husband Darren were graduates of UCL's Artificial Intelligence Lab – alongside other veteran entrepreneurs such as Jason Kingdon, who founded UCL spinout Searchspace, which was famous at the time for the quality of its anti-money laundering software. Searchspace has been using machine learning techniques for years to combat money laundering, employing tools that compared millions of transactions and distinguished between legitimate and fraudulent transactions between buyers and sellers. Like Searchspace, Intelligent Financial Systems (IFS) succeeded early in cracking the difficult US financial software market. Back in 2000, the company won a contract to study and analyse the enormous volumes of data emerging daily from the Chicago Board of Trade.